Overview
Brought to you by YData
Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 300153 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 27.5 MiB |
| Average record size in memory | 96.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 2 |
airline is highly overall correlated with flight | High correlation |
class is highly overall correlated with df_index and 1 other fields | High correlation |
df_index is highly overall correlated with class and 1 other fields | High correlation |
duration is highly overall correlated with stops | High correlation |
flight is highly overall correlated with airline | High correlation |
price is highly overall correlated with class and 1 other fields | High correlation |
stops is highly overall correlated with duration | High correlation |
stops is highly imbalanced (50.6%) | Imbalance |
df_index is uniformly distributed | Uniform |
df_index has unique values | Unique |
airline has 16098 (5.4%) zeros | Zeros |
source_city has 52061 (17.3%) zeros | Zeros |
departure_time has 47794 (15.9%) zeros | Zeros |
arrival_time has 38139 (12.7%) zeros | Zeros |
destination_city has 51068 (17.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-08-02 10:02:18.607105 |
|---|---|
| Analysis finished | 2025-08-02 10:02:39.134888 |
| Duration | 20.53 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
df_index
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 300153 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 150076 |
| Minimum | 0 |
|---|---|
| Maximum | 300152 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15007.6 |
| Q1 | 75038 |
| median | 150076 |
| Q3 | 225114 |
| 95-th percentile | 285144.4 |
| Maximum | 300152 |
| Range | 300152 |
| Interquartile range (IQR) | 150076 |
Descriptive statistics
| Standard deviation | 86646.852 |
|---|---|
| Coefficient of variation (CV) | 0.57735315 |
| Kurtosis | -1.2 |
| Mean | 150076 |
| Median Absolute Deviation (MAD) | 75038 |
| Skewness | 0 |
| Sum | 4.5045762 × 1010 |
| Variance | 7.507677 × 109 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 300152 | 1 | < 0.1% |
| 0 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Other values (300143) | 300143 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 300152 | 1 | |
| 300151 | 1 | |
| 300150 | 1 | |
| 300149 | 1 | |
| 300148 | 1 | |
| 300147 | 1 | |
| 300146 | 1 | |
| 300145 | 1 | |
| 300144 | 1 | |
| 300143 | 1 |
airline
Real number (ℝ)
High correlation  Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1048732 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 16098 |
| Zeros (%) | 5.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.8332648 |
|---|---|
| Coefficient of variation (CV) | 0.59044756 |
| Kurtosis | -1.5920362 |
| Mean | 3.1048732 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.21131816 |
| Sum | 931937 |
| Variance | 3.3608598 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 127859 | |
| 1 | 80892 | |
| 3 | 43120 | 14.4% |
| 2 | 23173 | 7.7% |
| 0 | 16098 | 5.4% |
| 4 | 9011 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 16098 | 5.4% |
| 1 | 80892 | |
| 2 | 23173 | 7.7% |
| 3 | 43120 | 14.4% |
| 4 | 9011 | 3.0% |
| 5 | 127859 |
| Value | Count | Frequency (%) |
| 5 | 127859 | |
| 4 | 9011 | 3.0% |
| 3 | 43120 | 14.4% |
| 2 | 23173 | 7.7% |
| 1 | 80892 | |
| 0 | 16098 | 5.4% |
flight
Real number (ℝ)
High correlation 
| Distinct | 1561 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1088.3385 |
| Minimum | 0 |
|---|---|
| Maximum | 1560 |
| Zeros | 51 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 258 |
| Q1 | 783 |
| median | 1142 |
| Q3 | 1486 |
| 95-th percentile | 1547 |
| Maximum | 1560 |
| Range | 1560 |
| Interquartile range (IQR) | 703 |
Descriptive statistics
| Standard deviation | 426.69135 |
|---|---|
| Coefficient of variation (CV) | 0.39205757 |
| Kurtosis | -0.69662678 |
| Mean | 1088.3385 |
| Median Absolute Deviation (MAD) | 347 |
| Skewness | -0.60046202 |
| Sum | 3.2666806 × 108 |
| Variance | 182065.51 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1442 | 3235 | 1.1% |
| 1454 | 2741 | 0.9% |
| 1445 | 2650 | 0.9% |
| 1490 | 2542 | 0.8% |
| 1477 | 2468 | 0.8% |
| 1483 | 2440 | 0.8% |
| 1518 | 2423 | 0.8% |
| 1486 | 2404 | 0.8% |
| 1481 | 2335 | 0.8% |
| 1508 | 2329 | 0.8% |
| Other values (1551) | 274586 |
| Value | Count | Frequency (%) |
| 0 | 51 | |
| 1 | 39 | |
| 2 | 5 | < 0.1% |
| 3 | 49 | |
| 4 | 20 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 94 | |
| 7 | 51 | |
| 8 | 91 | |
| 9 | 48 |
| Value | Count | Frequency (%) |
| 1560 | 1266 | |
| 1559 | 1024 | |
| 1558 | 1273 | |
| 1557 | 911 | |
| 1556 | 1381 | |
| 1555 | 1012 | |
| 1554 | 1002 | |
| 1553 | 996 | |
| 1552 | 924 | |
| 1551 | 782 |
source_city
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5775921 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 52061 |
| Zeros (%) | 17.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7517617 |
|---|---|
| Coefficient of variation (CV) | 0.67961169 |
| Kurtosis | -1.2902317 |
| Mean | 2.5775921 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.033005957 |
| Sum | 773672 |
| Variance | 3.0686691 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 61343 | |
| 5 | 60896 | |
| 0 | 52061 | |
| 4 | 46347 | |
| 3 | 40806 | |
| 1 | 38700 |
| Value | Count | Frequency (%) |
| 0 | 52061 | |
| 1 | 38700 | |
| 2 | 61343 | |
| 3 | 40806 | |
| 4 | 46347 | |
| 5 | 60896 |
| Value | Count | Frequency (%) |
| 5 | 60896 | |
| 4 | 46347 | |
| 3 | 40806 | |
| 2 | 61343 | |
| 1 | 38700 | |
| 0 | 52061 |
departure_time
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4173372 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 47794 |
| Zeros (%) | 15.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7542762 |
|---|---|
| Coefficient of variation (CV) | 0.72570604 |
| Kurtosis | -1.4218339 |
| Mean | 2.4173372 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.14775021 |
| Sum | 725571 |
| Variance | 3.0774849 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 71146 | |
| 1 | 66790 | |
| 2 | 65102 | |
| 5 | 48015 | |
| 0 | 47794 | |
| 3 | 1306 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 47794 | |
| 1 | 66790 | |
| 2 | 65102 | |
| 3 | 1306 | 0.4% |
| 4 | 71146 | |
| 5 | 48015 |
| Value | Count | Frequency (%) |
| 5 | 48015 | |
| 4 | 71146 | |
| 3 | 1306 | 0.4% |
| 2 | 65102 | |
| 1 | 66790 | |
| 0 | 47794 |
stops
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 2 | |
| 1 | 13286 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 250863 | |
| 2 | 36004 | 12.0% |
| 1 | 13286 | 4.4% |
arrival_time
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0740855 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 38139 |
| Zeros (%) | 12.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7416663 |
|---|---|
| Coefficient of variation (CV) | 0.56656404 |
| Kurtosis | -1.1530744 |
| Mean | 3.0740855 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.40287425 |
| Sum | 922696 |
| Variance | 3.0334016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 91538 | |
| 2 | 78323 | |
| 4 | 62735 | |
| 0 | 38139 | |
| 1 | 15417 | 5.1% |
| 3 | 14001 | 4.7% |
| Value | Count | Frequency (%) |
| 0 | 38139 | |
| 1 | 15417 | 5.1% |
| 2 | 78323 | |
| 3 | 14001 | 4.7% |
| 4 | 62735 | |
| 5 | 91538 |
| Value | Count | Frequency (%) |
| 5 | 91538 | |
| 4 | 62735 | |
| 3 | 14001 | 4.7% |
| 2 | 78323 | |
| 1 | 15417 | 5.1% |
| 0 | 38139 |
destination_city
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5883033 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 51068 |
| Zeros (%) | 17.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7445431 |
|---|---|
| Coefficient of variation (CV) | 0.67401032 |
| Kurtosis | -1.2904096 |
| Mean | 2.5883033 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.054994808 |
| Sum | 776887 |
| Variance | 3.0434307 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 59097 | |
| 2 | 57360 | |
| 0 | 51068 | |
| 4 | 49534 | |
| 3 | 42726 | |
| 1 | 40368 |
| Value | Count | Frequency (%) |
| 0 | 51068 | |
| 1 | 40368 | |
| 2 | 57360 | |
| 3 | 42726 | |
| 4 | 49534 | |
| 5 | 59097 |
| Value | Count | Frequency (%) |
| 5 | 59097 | |
| 4 | 49534 | |
| 3 | 42726 | |
| 2 | 57360 | |
| 1 | 40368 | |
| 0 | 51068 |
class
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 300153 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 206666 | |
| 0 | 93487 |
duration
Real number (ℝ)
High correlation 
| Distinct | 289 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.138864 |
| Minimum | 2.17 |
|---|---|
| Maximum | 25.92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 2.17 |
|---|---|
| 5-th percentile | 2.17 |
| Q1 | 6.83 |
| median | 11.25 |
| Q3 | 16.17 |
| 95-th percentile | 25.92 |
| Maximum | 25.92 |
| Range | 23.75 |
| Interquartile range (IQR) | 9.34 |
Descriptive statistics
| Standard deviation | 6.9188797 |
|---|---|
| Coefficient of variation (CV) | 0.56997752 |
| Kurtosis | -0.68168961 |
| Mean | 12.138864 |
| Median Absolute Deviation (MAD) | 4.67 |
| Skewness | 0.4788326 |
| Sum | 3643516.5 |
| Variance | 47.870896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.17 | 17393 | 5.8% |
| 25.92 | 15206 | 5.1% |
| 2.25 | 4036 | 1.3% |
| 2.75 | 2879 | 1.0% |
| 2.83 | 2323 | 0.8% |
| 12 | 2224 | 0.7% |
| 2.33 | 2053 | 0.7% |
| 7.58 | 2045 | 0.7% |
| 8 | 2040 | 0.7% |
| 11.17 | 1999 | 0.7% |
| Other values (279) | 247955 |
| Value | Count | Frequency (%) |
| 2.17 | 17393 | |
| 2.25 | 4036 | 1.3% |
| 2.33 | 2053 | 0.7% |
| 2.42 | 1252 | 0.4% |
| 2.5 | 1418 | 0.5% |
| 2.58 | 1166 | 0.4% |
| 2.67 | 1564 | 0.5% |
| 2.75 | 2879 | 1.0% |
| 2.83 | 2323 | 0.8% |
| 2.92 | 1430 | 0.5% |
| Value | Count | Frequency (%) |
| 25.92 | 15206 | |
| 25.83 | 491 | 0.2% |
| 25.75 | 553 | 0.2% |
| 25.67 | 653 | 0.2% |
| 25.58 | 459 | 0.2% |
| 25.5 | 694 | 0.2% |
| 25.42 | 515 | 0.2% |
| 25.33 | 532 | 0.2% |
| 25.25 | 499 | 0.2% |
| 25.17 | 609 | 0.2% |
days_left
Real number (ℝ)
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.004751 |
| Minimum | 1 |
|---|---|
| Maximum | 49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 26 |
| Q3 | 38 |
| 95-th percentile | 47 |
| Maximum | 49 |
| Range | 48 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.561004 |
|---|---|
| Coefficient of variation (CV) | 0.52148178 |
| Kurtosis | -1.1562147 |
| Mean | 26.004751 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.03546435 |
| Sum | 7805404 |
| Variance | 183.90082 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 6633 | 2.2% |
| 18 | 6602 | 2.2% |
| 39 | 6593 | 2.2% |
| 32 | 6585 | 2.2% |
| 26 | 6573 | 2.2% |
| 24 | 6542 | 2.2% |
| 19 | 6537 | 2.2% |
| 31 | 6534 | 2.2% |
| 33 | 6532 | 2.2% |
| 40 | 6531 | 2.2% |
| Other values (39) | 234491 |
| Value | Count | Frequency (%) |
| 1 | 1927 | 0.6% |
| 2 | 4026 | |
| 3 | 4248 | |
| 4 | 5077 | |
| 5 | 5392 | |
| 6 | 5740 | |
| 7 | 5703 | |
| 8 | 5767 | |
| 9 | 5665 | |
| 10 | 5822 |
| Value | Count | Frequency (%) |
| 49 | 6154 | |
| 48 | 6078 | |
| 47 | 6069 | |
| 46 | 6160 | |
| 45 | 6314 | |
| 44 | 6436 | |
| 43 | 6472 | |
| 42 | 6497 | |
| 41 | 6525 | |
| 40 | 6531 |
price
Real number (ℝ)
High correlation 
| Distinct | 10846 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20493.977 |
| Minimum | 2436 |
|---|---|
| Maximum | 63277 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 2436 |
|---|---|
| 5-th percentile | 2436 |
| Q1 | 4783 |
| median | 7425 |
| Q3 | 42521 |
| 95-th percentile | 63277 |
| Maximum | 63277 |
| Range | 60841 |
| Interquartile range (IQR) | 37738 |
Descriptive statistics
| Standard deviation | 21746.461 |
|---|---|
| Coefficient of variation (CV) | 1.0611148 |
| Kurtosis | -0.83281256 |
| Mean | 20493.977 |
| Median Absolute Deviation (MAD) | 3929 |
| Skewness | 0.95491653 |
| Sum | 6.1513286 × 109 |
| Variance | 4.7290858 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2436 | 15066 | 5.0% |
| 63277 | 15017 | 5.0% |
| 54608 | 1445 | 0.5% |
| 54684 | 1390 | 0.5% |
| 60978 | 1383 | 0.5% |
| 60508 | 1230 | 0.4% |
| 49725 | 1205 | 0.4% |
| 51707 | 1205 | 0.4% |
| 5949 | 1196 | 0.4% |
| 49613 | 1150 | 0.4% |
| Other values (10836) | 259866 |
| Value | Count | Frequency (%) |
| 2436 | 15066 | |
| 2437 | 21 | < 0.1% |
| 2438 | 37 | < 0.1% |
| 2447 | 18 | < 0.1% |
| 2449 | 11 | < 0.1% |
| 2456 | 53 | < 0.1% |
| 2463 | 30 | < 0.1% |
| 2464 | 307 | 0.1% |
| 2465 | 50 | < 0.1% |
| 2468 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 63277 | 15017 | |
| 63269 | 4 | < 0.1% |
| 63226 | 15 | < 0.1% |
| 63218 | 24 | < 0.1% |
| 63163 | 49 | < 0.1% |
| 63151 | 1 | < 0.1% |
| 63121 | 2 | < 0.1% |
| 63065 | 2 | < 0.1% |
| 63053 | 19 | < 0.1% |
| 63027 | 4 | < 0.1% |
Interactions
Correlations
| airline | arrival_time | class | days_left | departure_time | destination_city | df_index | duration | flight | price | source_city | stops | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| airline | 1.000 | 0.036 | 0.449 | -0.009 | 0.054 | -0.030 | 0.186 | 0.015 | 0.703 | 0.290 | -0.027 | 0.174 |
| arrival_time | 0.036 | 1.000 | 0.106 | -0.004 | -0.057 | -0.029 | 0.027 | 0.019 | 0.046 | 0.040 | 0.044 | 0.066 |
| class | 0.449 | 0.106 | 1.000 | 0.020 | 0.070 | 0.028 | 0.976 | 0.186 | 0.425 | 0.984 | 0.028 | 0.132 |
| days_left | -0.009 | -0.004 | 0.020 | 1.000 | -0.002 | -0.005 | 0.014 | -0.033 | -0.001 | -0.266 | -0.004 | 0.016 |
| departure_time | 0.054 | -0.057 | 0.070 | -0.002 | 1.000 | 0.001 | 0.084 | 0.118 | 0.076 | 0.055 | -0.009 | 0.077 |
| destination_city | -0.030 | -0.029 | 0.028 | -0.005 | 0.001 | 1.000 | 0.021 | -0.003 | -0.062 | 0.012 | -0.223 | 0.102 |
| df_index | 0.186 | 0.027 | 0.976 | 0.014 | 0.084 | 0.021 | 1.000 | 0.182 | 0.178 | 0.658 | -0.092 | 0.118 |
| duration | 0.015 | 0.019 | 0.186 | -0.033 | 0.118 | -0.003 | 0.182 | 1.000 | 0.195 | 0.318 | 0.007 | 0.665 |
| flight | 0.703 | 0.046 | 0.425 | -0.001 | 0.076 | -0.062 | 0.178 | 0.195 | 1.000 | 0.319 | 0.026 | 0.168 |
| price | 0.290 | 0.040 | 0.984 | -0.266 | 0.055 | 0.012 | 0.658 | 0.318 | 0.319 | 1.000 | 0.014 | 0.280 |
| source_city | -0.027 | 0.044 | 0.028 | -0.004 | -0.009 | -0.223 | -0.092 | 0.007 | 0.026 | 0.014 | 1.000 | 0.063 |
| stops | 0.174 | 0.066 | 0.132 | 0.016 | 0.077 | 0.102 | 0.118 | 0.665 | 0.168 | 0.280 | 0.063 | 1.000 |
Missing values
Sample
| df_index | airline | flight | source_city | departure_time | stops | arrival_time | destination_city | class | duration | days_left | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 4 | 1408 | 2 | 2 | 2 | 5 | 5 | 1 | 2.17 | 1 | 5953 |
| 1 | 1 | 4 | 1387 | 2 | 1 | 2 | 4 | 5 | 1 | 2.33 | 1 | 5953 |
| 2 | 2 | 0 | 1213 | 2 | 1 | 2 | 1 | 5 | 1 | 2.17 | 1 | 5956 |
| 3 | 3 | 5 | 1559 | 2 | 4 | 2 | 0 | 5 | 1 | 2.25 | 1 | 5955 |
| 4 | 4 | 5 | 1549 | 2 | 4 | 2 | 4 | 5 | 1 | 2.33 | 1 | 5955 |
| 5 | 5 | 5 | 1541 | 2 | 4 | 2 | 0 | 5 | 1 | 2.33 | 1 | 5955 |
| 6 | 6 | 5 | 1533 | 2 | 4 | 2 | 4 | 5 | 1 | 2.17 | 1 | 6060 |
| 7 | 7 | 5 | 1543 | 2 | 0 | 2 | 2 | 5 | 1 | 2.17 | 1 | 6060 |
| 8 | 8 | 2 | 1013 | 2 | 1 | 2 | 4 | 5 | 1 | 2.17 | 1 | 5954 |
| 9 | 9 | 2 | 1014 | 2 | 0 | 2 | 2 | 5 | 1 | 2.25 | 1 | 5954 |
| df_index | airline | flight | source_city | departure_time | stops | arrival_time | destination_city | class | duration | days_left | price | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 300143 | 300143 | 1 | 722 | 1 | 1 | 0 | 5 | 3 | 0 | 17.42 | 49 | 51345 |
| 300144 | 300144 | 1 | 761 | 1 | 2 | 0 | 4 | 3 | 0 | 18.92 | 49 | 51345 |
| 300145 | 300145 | 1 | 716 | 1 | 4 | 0 | 4 | 3 | 0 | 23.08 | 49 | 51345 |
| 300146 | 300146 | 1 | 722 | 1 | 1 | 0 | 4 | 3 | 0 | 25.92 | 49 | 51345 |
| 300147 | 300147 | 1 | 776 | 1 | 1 | 0 | 5 | 3 | 0 | 17.25 | 49 | 63277 |
| 300148 | 300148 | 5 | 1477 | 1 | 4 | 0 | 2 | 3 | 0 | 10.08 | 49 | 63277 |
| 300149 | 300149 | 5 | 1481 | 1 | 0 | 0 | 5 | 3 | 0 | 10.42 | 49 | 63277 |
| 300150 | 300150 | 5 | 1486 | 1 | 1 | 0 | 5 | 3 | 0 | 13.83 | 49 | 63277 |
| 300151 | 300151 | 5 | 1483 | 1 | 1 | 0 | 2 | 3 | 0 | 10.00 | 49 | 63277 |
| 300152 | 300152 | 5 | 1477 | 1 | 4 | 0 | 2 | 3 | 0 | 10.08 | 49 | 63277 |